Quality Estimation from Scratch
نویسندگان
چکیده
This thesis presents a deep neural network for word-level machine translation quality estimation. The model extends the feedforward multi-layer architecture by [Collobert et al., 2011] to learning continuous space representations for bilingual contexts from scratch. By means of stochastic gradient descent and backpropagation of errors, the model is trained for binary classification of translated words, given only the source sentence and the machine translation. We enhance this model with alignments, and unsupervised pre-training of word representations allows for leveraging large monolingual corpora for supervised quality estimation training. Evaluating it on the data provided by the Workshop on Statistical Machine Translation 2014 and 2015, the model yields competitive results across languages and datasets. A linear combination of the deep model and a shallow linear model trained on baseline features further improves over both individual models. Furthermore, the bilingual word representations learnt during supervised training for quality estimation prove useful for other cross-lingual tasks.
منابع مشابه
QUality Estimation from ScraTCH (QUETCH): Deep Learning for Word-level Translation Quality Estimation
This paper describes the system submitted by the University of Heidelberg to the Shared Task on Word-level Quality Estimation at the 2015 Workshop on Statistical Machine Translation. The submitted system combines a continuous space deep neural network, that learns a bilingual feature representation from scratch, with a linear combination of the manually defined baseline features provided by the...
متن کاملQuesting for Quality Estimation A User Study
Post-Editing of Machine Translation (MT) has become a reality in professional translation workflows. In order to optimize the management of projects that use post-editing and avoid underpayments and mistrust from professional translators, effective tools to assess the quality of Machine Translation (MT) systems need to be put in place. One field of study that could address this problem is Machi...
متن کاملDetection of Line Scratch in Video
Most common defects are flicker, dirt, dust and line scratches.Here we consider line scratch detection.Line scratches appear as thin bright or dark line.This line are usually straight and vertical.The restoration of old videos is based on primary interest because of great quantity of old film records. But manual digital restoration of videos is time consuming process. To detect the scratch in f...
متن کاملQuality Hound - An online code smell analyzer for scratch programs
In this showpiece, we demonstrate the functionality of Quality Hound — an online program analysis tool that takes as input a Scratch project and presents to the user a visual representation of the detected quality problems. Made accessible via a browser-based user interface, Quality Hound is instantaneously accessible to any Scratch user all over the world. The design of Quality Hound is inform...
متن کاملPrediction of aqueous solubility from SCRATCH.
This study proposes the SCRATCH model for the aqueous solubility estimation of a compound directly from its structure. The algorithm utilizes predicted melting points and predicted aqueous activity coefficients. It uses two additive, constitutive molecular descriptors (enthalpy of melting and aqueous activity coefficient) and two non-additive molecular descriptors (symmetry and flexibility). Th...
متن کامل